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Abstract 

The effectiveness of screening programs for cervical cancer has benefited from 
the inclusion of Human papillomavirus (HPV) DNA assays; which assay to 
choose, however, is not clear based on previous reviews. Our review addressed 
test accuracy of Hybrid Capture II (HCII) and polymerase chain reaction 
(PCR) assays based on studies with stronger designs and with more clinically 
relevant outcomes. We searched OvidMedline, PubMed, and the Cochrane 
Library for English language studies comparing both tests, published 1985- 
2012, with cervical dysplasia defined by the Bethesda classification. Meta-analysis 
provided pooled sensitivity, specificity, and 95% confidence intervals (CIs); 
meta-regression identified sources of heterogeneity. From 29 reports, we found 
that the pooled sensitivity and specificity to detect high-grade squamous intra- 
epithelial lesion (HSIL) was higher for HCII than PCR (0.89 [CI: 0.89-0.90] 
and 0.85 [CI: 0.84-0.86] vs. 0.73 [CI: 0.73-0.74] and 0.62 [CI: 0.62-0.64]). 
Both assays had higher accuracy to detect cervical dysplasia in Europe than in 
Asia-Pacific or North America (diagnostic odd ratio - dOR = 4.08 [CI: 1.39- 
11.91] and 4.56 [CI: 1.86-11.17] for HCII vs. 2.66 [CI: 1.16-6.53] and 3.78 [CI: 
1.50-9.51] for PCR) and accuracy to detect HSIL than atypical squamous cells 
of undetermined significance (ASCUS)/ low-grade squamous intraepithelial 
lesion (LSIL) (HCII-dOR = 9.04 [CI: 4.12-19.86] and PCR-dOR = 5.60 [CI: 
2.87-10.94]). For HCII, using histology as a gold standard results in higher 
accuracy than using cytology (dOR = 2.87 [CI: 1.31-6.29]). Based on higher 
test accuracy, our results support the use of HCII in cervical cancer screening 
programs. The role of HPV type distribution should be explored to determine 
the worldwide comparability of HPV test accuracy. 



Introduction 

Cervical cancer is a significant cause of morbidity and 
mortality among women worldwide [ 1 ] . Human papilloma- 
virus (HPV) infection is one of the most common sexu- 
ally transmitted diseases in the world, and infection with 



high-risk oncogenic types of HPV has been recognized as 
a necessary cause of cervical cancer and its precursor 
lesion, cervical intraepithelial neoplasia (CIN) [2, 3]. For- 
tunately, preventing cervical cancer is possible due to its 
distinct premalignant stage, and as the introduction of 
population-based screening programs, cervical cancer 
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incidence and mortality have greatly decreased in devel- 
oped countries [4-6]. Screening has largely relied on 
cytology-based tests; however, given their subjective nat- 
ure as well as low sensitivity and specificity, adding HPV 
DNA testing to screening programs to improve their effi- 
cacy has been proposed [7-9]. Furthermore, the positive 
predictive value (PPV) of current screening tests is pro- 
jected to decrease in populations vaccinated against HPV, 
but this drop in test performance could be mitigated by 
adding HPV DNA testing to the screening paradigm [10]. 
There are several ways in which HPV DNA testing might 
be implemented. First, the HPV DNA assay may be used, 
either in combination with cytology or alone, as the pri- 
mary screening method. Studies have shown that HPV 
testing has a higher sensitivity than cytology, indicating 
that a longer interval between screenings is possible when 
including HPV DNA testing in a screening program 
[ 1 1-14] . Second, HPV DNA detection may be used to tri- 
age women with cytological abnormalities to determine 
whether referral for colposcopy is warranted [15, 16]. 
Lastly, HPV DNA testing may be used as a follow-up to 
detect residual disease or predict recurrence among 
women who have been treated for high-grade CIN [17]. 

The two most common methods used for HPV DNA 
detection are the Hybrid Capture II (HCII, Qiagen Gai- 
thersburg, Inc., MD) and polymerase chain reaction 
(PCR) assays. The HCII assay is commercially available 
and approved for clinical use, and several types of PCR 
assays have been primarily used in the research setting 
[18]. Furthermore, both assays have shown high sensitiv- 
ity to detect high-risk HPV infections but only moderate 
specificity [18]. Meijer et al. [19] have recommended that 
any HPV DNA test should have an optimal balance 
between clinical sensitivity and specificity. Stoler et al. 
[20] proposed a minimum sensitivity of 92% and a speci- 
ficity of 85% for any new HPV DNA test. The selection 
of a screening test is important to detect clinically rele- 
vant cases of HPV infection while avoiding the unneces- 
sary cost, stress, and compromise of the cervix to patients 
associated with overtreating mild cytological abnormali- 
ties. Therefore, the goal of this meta-analysis was to com- 
pare the clinical performance of HCII and PCR assays in 
both the screening and diagnostic settings. 

Methods 

Search strategy 

Between June 2011 and June 2012, an experienced librar- 
ian (H. M. V.) and two investigators (H. N. L. and K. R. 
D.) conducted a systematic search to identify published 
studies from 1985 to June 2012. Three main biomedical 
databases (OvidMedline, PubMed, and Cochrane Library) 
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were searched using the following search terms: ([PCR] 
OR [HCII] OR [Molecular diagnostic techniques]) AND 
([Infections] OR [Papillomavirus infections] OR [Uterine 
cervical neoplasm] OR [cancer cervix] OR [cervical can- 
cer or cervical neoplasm] OR [HPV or papilloma] OR 
[HSIL] OR [LSIL] OR [[high grade] OR [low grade]] OR 
[CIN]). 

Study screening and selection 

Inclusion criteria for the meta-analysis were English lan- 
guage reports of studies comparing the sensitivity and 
specificity of PCR (i.e., MY/PGMY 09/11 or GP5+/6+ or 
Amplicor) and HCII using either cytologic or histologic 
results as the gold standard for testing comparison (i.e., 
Bethesda classification system) in either a screening or 
follow-up/diagnostic setting. The three specific PCR tests 
mentioned above were chosen because they are currently 
the most-used tests. All citations were independently 
reviewed by two investigators (H. N. L. and K. R. D.). 
When necessary, authors of a selected article [21] were 
contacted to obtain further information. Normally, a 
standard threshold of 1 relative light unit (RLU) or 1 pg/mL 
of HCII was used to detect the positive presence of HPV 
DNA. However, to maximize power in our study, we did 
not restrict by this cutoff. 

Data abstraction and coding 

All eligible studies were abstracted independently by two 
reviewers (H. N. L. and K. R. D.) using a coding system 
based on the Standards for Reporting Diagnostic Accu- 
racy (STARD) and MOOSE for meta-analysis of observa- 
tional studies in epidemiology [22, 23]. Any discrepancies 
were resolved by discussion and consensus between the 
two investigators. Variables used to present our analysis 
were grouped into two components, as follows. 

Characteristics of study participants 

Year study conducted, study location, study settings/pop- 
ulation/inclusion-exclusion criteria, sample size, study 
design (i.e., cross-sectional, case-control, cohort, random- 
ized controlled trial), age (i.e., mean, median, range), and 
race/ethnicity. 

Testing methods 

Gold standard tests (cytology or histology); sample prepa- 
ration (cervical collection procedures, including collection 
devices, and DNA preparation for HPV testing methods); 
types of testing methods (PCR and HCII); test results 
(HPV prevalence, sensitivity, specificity, [PPV], negative 
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predictive value [NPV], agreement and level of reproduc- 
ibility [kappa -k], and their respective 95% confidence 
interval, CI); and blinding and/or quality control methods. 

Clinical outcomes 

Three clinical outcomes were examined, atypical squamous 
cells of undetermined significance (ASCUS), low-grade 
squamous intraepithelial lesion (LSIL), which includes 
HPV infection or mild dysplasia (CIN1), and high-grade 
squamous intraepithelial lesion (HSIL), which includes 
moderate (CIN2) and severe dysplasia (CIN3) [24]. 
Because there were only 10 studies on ASCUS, we merged 
ASCUS and LSIL to improve the power of analysis. 

Statistical analysis 

In this meta-analysis, a study unit was defined as a study 
having complete information to compare the testing accu- 
racy between PCR and HCII. Depending on the specific 
PCR test, setting, gold standard, age group, or sample col- 
lection method, one article could contribute more than 
one study unit. For example, an article by Riethmuller 
et al. [25] compared PCR MY09/11 with HCII in two clini- 
cal outcomes (i.e., LSIL and HSIL) and thus generated two 
study units. Likewise, an article by Stevens et al. [26] con- 
tributed eight study units for our analysis because the origi- 
nal analysis included both cytology and histology as gold 
standards with two clinical outcomes (LSIL vs. HSIL) and 
two types of PCR (PGMY09/11 and Amplicor). 

Complete information from each study was extracted 
to construct two-by-two tables, which included true- 
positive, false-positive, true-negative, and false-negative val- 
ues. The sensitivity was calculated as (true positive)/([true 
positive]+[false negative]) and specificity was calculated 
as (true negative)/([true negative]-!- [false positive]). Forest 
plots were generated to present, by type of clinical out- 
come, individual and pooled sensitivity and specificity of 
each test and to show heterogeneity across studies [27, 
28]. Additionally, heterogeneity across studies was exam- 
ined using Cochran's Q-test and the chi-square test [27, 
29]. To examine the threshold effect or the difference 
derived from the use of different cutoffs or thresholds, we 
computed the Spearman correlation coefficient. This 
coefficient can be defined as the result of the logit of 
sensitivity divided by the logit of (1 -specificity) [30]. 

Stratified meta-analysis and meta-regression were used 
to examine the influence of study characteristics and the 
magnitude of interstudy heterogeneity on sensitivity and 
specificity for both PCR and HCII. Stratified meta- 
analyses were performed for the two clinical outcomes 
(ASCUS/LSIL and HSIL) by setting (screening vs. follow- 
up/diagnostic) and by PCR testing technique (i.e., MY/ 
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PGMY 09/11, GP5+/6+, and Amplicor). Age is an impor- 
tant variable, particularly age cutoff of 30 years old; how- 
ever, in our analysis, only two articles reported age using 
this cutoff (Luu, H. N., K. Adler-Storthz, L. M. Dillon, 
M. Follen, and M. E. Scheurer, submitted) [31]. We, 
therefore, decided not to include this variable in the anal- 
ysis because of low power and inability to generate pooled 
sensitivity and/or specificity or perform meta-regression 
[30]. For meta-regression, a generalized linear model was 
fitted to the data and weighted by the inverse of the vari- 
ance using Moses and Littenberg methods [32]. Addition- 
ally, a random effects model was used to pool variation 
between studies [33] in the current meta-regression 
model. For both PCR and HCII, four variables (setting, 
gold standard, type of lesion, and study location) were 
included in the meta-regression models. For PCR, an 
additional variable, type of PCR (i.e., MY/PGMY 09/11, 
GP5+/6+, Amplicor), was added to the meta-regression 
model. Diagnostic odds ratios (dOR) and their respective 
95% CIs were calculated to determine diagnostic test per- 
formance as well as the influence of covariates on test 
accuracy. The dOR is a measure of the effectiveness of a 
diagnostic test. It is defined as the ratio of the odds of 
the test being positive if the subject has a disease relative 
to the odds of the test being positive if the subject does 
not have the disease. The dOR was calculated as (sensitiv- 
ity odds)/(odds of [1 -specificity]) [34]. Meta-DiSc [30], a 
comprehensive software program to evaluate diagnostic 
and screening tests through meta-analysis, was used to 
perform the statistical analysis for this study. All statistical 
tests were two sided and were considered significant at 
the level of P < 0.05. 

Results 

Search results 

We identified 481 citations from search databases, of 
which 259 citations were duplicates (Fig. 1). By examin- 
ing reference lists of those studies, we found an additional 
48 citations. We then excluded 175 citations by applying 
the inclusion criteria to the titles and abstracts and 
retrieved 95 full-text articles for further review. The 
review process yielded 28 articles that met all inclusion 
criteria [14, 21, 25, 26, 31, 35-57]. In addition, we added 
one manuscript from our own group (Luu et al., sub- 
mitted), which is currently under review (Fig. 1). 

Characteristics of included articles 

A total of 22,417 women were described, with four arti- 
cles [38, 41, 48, 54] contributing fewer than 100 partici- 
pants and 10 articles [14, 21, 25, 26, 31, 37, 39, 42, 52] 
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f \ 
Citations outside 

database searches 

(Jt = 48) 
v 



Total citations included 
from database 
(fc = 481) 



Exclusion: 

- Internal duplicates (k = 19) 

- External duplicates (*r = 240) 



Total citations after 
duplicate exclusion 
(fc = 270) 



Full text available for 
review (k = 95) 



Articles meeting all inclusion 
criteria 
(Jr=28) 
Article from our lab (k = 1) 



Citations excluded by 
examining abstracts: 

-Other than English (fc= 1) 

- One test only (<r = 137) 

- Gold standard other than 
cytology/histology (k = 8) 

- No categorization of cervical 
lesion (k = 28) 



Citations excluded by 
examining full text: 

- Other than English (k- 3) 
-One test only (it = 49) 

- Gold standard other than 
cytology/histology [k = 4) 

- No categorization of cervical 
lesion (It = 15) 



(Luu et al., submitted), more than 500 participants. The 
article by Schiffman et al. [42] had the largest sample size, 
with more than 3400 participants. 

Over half of the articles (k = 16) [14, 21, 25, 31, 35, 
36, 39, 43, 45-51, 54] reported results from Europe, with 
the remainder from Asia-Pacific (k = 5) [26, 44, 52, 56, 
57], North America (k = 6) (Luu et al., submitted) [37, 
38, 42, 53, 55], and South America (k = 2) [40, 41]. Just 
under half (k = 14) [25, 38-41, 44-46, 49-53, 57] 
reported studies were conducted in screening settings; 10 
were in a follow-up/diagnostic setting [21, 26, 31, 35, 42, 
47, 48, 54-56] and the remainder (k = 5) (Luu et al., 
submitted) [14, 36, 37, 43] were in both screening and 
follow-up/diagnostic settings. Most (k = 23) used a cross- 
sectional study design [21, 25, 26, 31, 35, 36, 38-41, 44- 
54, 56, 57]; four, a cohort study design (Luu et al., sub- 
mitted) [14, 37, 43] and two, a randomized controlled 
trial design [42, 55]. The age range was 15 to 81 years. 



Figure 1. Screening and eligibility evaluation 
process. 

Half (Jfc = 15) [25, 35-40, 43, 44, 46-48, 51, 54, 55] used 
cytology as the gold standard, 12 (Luu et al, submitted) 
[14, 21, 31, 42, 45, 49, 50, 52, 53, 56, 57] used histology 
as the gold standard, and two [26, 41] used both. 

The 29 included articles contained 82 PCR study units 
and 79 HCII study units (Table 1). The uneven number 
of study units between PCR and HCII resulted from two 
articles [26, 31] that contained studies comparing more 
than one type of PCR with HCII. 

Accuracy of the tests 

In detecting ASCUS/LSIL, PCR was more sensitive 
(Fig. 2A and B) but less specific than HCII (Fig. 3A and 
B) (pooled sensitivity: 0.70 [95% CI: 0.68-0.71], pooled 
specificity: 0.45 [95% CI: 0.44-0.46] for PCR and pooled 
sensitivity: 0.56 [95% CI: 0.55-0.58], pooled specificity: 
0.63 [95% CI: 0.62-0.64] for HCII. PCR GP5+/6+, how- 
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ever, was both less sensitive and less specific than HCII in 
detecting ASCUS/LSIL lesions (pooled sensitivity: 0.51 
[95% CI: 0.45-0.58], pooled specificity: 0.56 [95% CI: 
0.53-0.60]) (Table 2). HCII was more sensitive and spe- 
cific than PCR in detecting HSIL (Fig. 2C and D, Fig. 3C 
and D) (pooled sensitivity: 0.89 [95% CI: 0.89-0.90] and 
pooled specificity: 0.73 [95% CI: 0.73-0.74] for HCII vs. 
pooled sensitivity: 0.85 [95% CI: 0.84-0.86] and pooled 
specificity: 0.62 [95% CI: 0.62-0.63] for PCR). Consistent 
with the overall analysis, HCII showed a much higher 
sensitivity and specificity than PCR GP5+/6+ in detecting 
HSIL in the screening setting (pooled sensitivity: 0.90 
[95% CI: 0.88-0.91] and pooled specificity: 0.77 [95% CI: 
0.77-0.78] for HCII vs. pooled sensitivity: 0.55 [95% CI: 
0.50-0.60] and pooled specificity: 0.58 [95% CI: 0.56- 
0.60] for PCR) (Table 2). 

Heterogeneity and meta-regression 

The Spearman correlation coefficients of HCII and PCR 
were 0.005 (P = 0.96) and 0.234 (P = 0.03), respectively. 
Further analysis showed that besides a threshold effect for 
PCR, both HCII and PCR had heterogeneity due to vari- 
ous factors (HCII: f = 5184.26, iWrogeneity < 0.0001, 
7 2 = 98.5% and PCR: y 2 = 2540.74, iWrogeneity < 0.0001, 
I 2 = 96.8%). 

Setting was not a source of heterogeneity for either 
PCR or HCII accuracy (Table 3). PCR has higher accu- 
racy to detect HSIL than to detect ASCUS/LSIL (dOR: 
5.60 [95% CI: 2.87-10.94], P < 0.0001). PCR MY/PGMY 
09/11 and Amplicor showed 2.75 (95% CI: 1.16-6.53, 
P = 0.02) and 3.01 (95% CI: 1.05-8.63, P = 0.04) times 
higher accuracy in detecting dysplasia lesions than PCR 
GP5+/6+ (Table 3). Also, PCR has higher accuracy in 
detecting dysplasia lesions in Europe than in Asia-Pacific 
Region or North America (dOR: 2.66 [95% CI: 
1.16-6.53], P = 0.03 and dOR: 3.78 [95% CI: 1.50-9.51], 
P = 0.005 [data not shown], respectively). 

For HCII accuracy, we did find that the use of gold stan- 
dard methods was contributed to heterogeneity 
(dOR = 2.87 [95% CI: 1.31-6.29], P = 0.009, comparison 
between histology and cytology, cytology = reference 
group). The HCII is 9.04 times (95% CI: 4.12-19.86) more 
accurate to detect HSIL than ASCUS/LSIL. Similarly, HCII 
had higher accuracy in detecting lesions in Europe than in 
Asia-Pacific Region or North America (dOR: 4.08 [95% CI: 
1.39-11.91], P = 0.01 and dOR: 4.56 [95% CI: 1.86-11.17], 
P = 0.001 [data not shown in Table 3], respectively). 

Discussion 

In the current meta-analysis, we examined the pooled sen- 
sitivity and specificity of HCII and PCR in the screening 
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Figure 2. Forest plots of PCR performance in identifying clinical lesions. 
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Figure 3. Forest plots of HCII performance in identifying clinical lesions. 
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Table 2. Pooled sensitivity and specificity of PCR and HCII by lesion types (Bethesda classification). 



Lesion types 


# Study units 


Pooled sensitivity (95% CI) 


Pooled specificity (95% CI) 


PCR 








ASCUS/LSIL 


35 


0.70 (0.68-0.71) 


0.45 (0.44-0.46) 


MY/PGMY 09/11 


17 1 


0.69 (0.67-0.72) 


0.48 (0.47-0.50) 


Screening 


14 


0.64 (0.61-0.67) 


0.56 (0.55-0.58) 


Diagnostic/Follow-up 


5 


0.75 (0.72-0.79) 


0.33 (0.31-0.35) 


GP5+/6+ 


11 


0.51 (0.45-0.58) 


0.56 (0.53-0.60) 


Screening 


5 


0.47 (0.40-0.54) 


0.62 (0.57-0.66) 


Diagnostic/Follow-up 


6 


0.77 (0.60-0.90) 


0.48 (0.42-0.54) 


Amplicor 


7 


0.73 (0.70-0.75) 


0.36 (0.34-0.38) 


Screening 


3 


0.55 (0.51-0.60) 


0.44 (0.40-0.48) 


Diagnostic/Follow-up 


4 


0.81 (0.79-0.83) 


0.33 (0.31-0.36) 


HSIL 


43 


0.85 (0.84-0.86) 


0.62 (0.62-0.63) 


MY/PGMY 09/11 


25 


0.84 (0.83-0.85) 


0.66 (0.65-0.66) 


Screening 


13 


0.87 (0.86-0.88) 


0.67 (0.66-0.68) 


Diagnostic/Follow-up 


12 


0.82 (0.81-0.83) 


0.65 (0.64-0.66) 


GP5+/6+ 


12 


0.75 (0.72-0.79) 


0.54 (0.52-0.55) 


Screening 


7 


0.55 (0.50-0.60) 


0.58 (0.56-0.60) 


Diagnostic/Follow-up 


5 


0.97 (0.95-0.99) 


0.43 (0.39-0.46) 


Amplicor 


6 


0.94 (0.93-0.95) 


0.38 (0.36-0.40) 


Screening 


3 


0.93 (0.88-0.96) 


0.50 (0.46-0.53) 


Pli^n nnctif/Fnl ln\/\/-i in 
uiciui lujiiuruiiuvv uu 
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HCII 








ASCUS/LSIL 


34 


0.56 (0.55-0.58) 


0.63 (0.62-0.64) 


Screening 


22 


0.46 (0.43-0.48) 


0.71 (0.70-0.72) 


Diagnostic/Follow-up 


13 


0.67 (0.65-0.70) 


0.51 (0.49-0.52) 


HSIL 


43 


0.89 (0.89-0.90) 


0.73 (0.73-0.74) 


Screening 


23 


0.90 (0.88-0.91) 


0.77 (0.77-0.78) 


Diagnostic/Follow-up 


21 


0.89 (0.88-0.90) 


0.70 (0.70-0.71) 



ASCUS, atypical squamous cell of undetermined significance; LSIL, low-grade squamous intraepithelial lesion; HSIL, high-grade squamous intraepi- 
thelial lesion. 

1 The Venturoli et al. study [36] contributes 2 study units in the screening group and 2 study units in the diagnostic group. 



and follow-up/diagnostic settings, following the 2001 
Bethesda Classification (i.e., ASCUS, LSIL, and HSIL). 
We identified 28 published articles and our own manu- 
script that compared HCII and PCR in the same report. 
We found that in detecting ASCUS/LSIL, HCII was less 
sensitive but more specific than PCR. We also found that 
HCII was both more sensitive and more specific in 
detecting HSIL than was PCR (both in screening and 
diagnostic settings). Clinical outcome and study location 
were sources of heterogeneity for the accuracy of both 
PCR and HCII. Additionally, PCR types and gold stan- 
dard were sources of interstudy variability of the accuracy 
of PCR and HCII, respectively. 

To our knowledge, this is the first meta-analysis that 
directly compares the accuracy of HCII and PCR in 
screening and diagnostic settings and across two clinical 
outcomes. In 2004, Arbyn and associates [9] conducted a 
meta-analysis and reported that for ASCUS detection, 
HCII alone was less sensitive but more specific (pooled 
sensitivity: 0.95 [95% CI: 0.93-0.97] and pooled specific- 



ity: 0.67 [95% CI: 0.58-0.76]) than HPV DNA testing 
([all] pooled sensitivity: 0.84 [95% CI: 0.78-0.91] and 
pooled specificity: 0.73 [95% CI: 0.63-0.83]) by PCR. The 
major difference between our study and Arbyn's [9] is that 
they reported the test accuracy of HCII and a combination 
of HPV DNA tests (i.e., HCII, HCI, and PCR). Therefore, 
HCII and PCR were not compared directly. The other dif- 
ference between our studies is that they [9] included test 
results from studies of a single test, whereas our review 
was restricted to studies that compared HCII and PCR. 
Because findings of the test accuracy come from the same 
population, we sought to minimize the source of inter- 
study heterogeneity. Furthermore, Arbyn et al. [9] 
included 17 articles from 1992 to 2002, whereas we identi- 
fied 29 articles from 1999 to 2011. During the 1999-2011 
period, HCII has been used more than it was during the 
period covered by Arbyn, and our meta-analysis included 
only one article [35] that was also in Arbyn's meta- 
analysis [9]. Additionally, the meta-analysis by Arbyn 
et al. [9] was restricted to cross-sectional studies while 
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Table 3. Multivariate meta-regression of PCR and HCII performance. 



Variable 



Coeff. 



Standard error 



P-value 



dOR (95% CI) 



PCR 
Settings 
Screening 

Diagnostic/Follow-up 
Gold Standard 

Cytology 

Histology 
Lesion (dOR) 

ASCUS/LSIL 

HSIL 
PCR types 

GP5+/6+ 

MY/PGMY 09/1 1 

Amplicor 
Location 

Asia-Pacific 

North America 

South America 

Europe 

HCII 
Settings 
Screening 

Diagnostic/Follow-up 
Gold Standard 

Cytology 

Histology 
Lesion 

ASCUS/LSIL 

HSIL 
Location 

Asia-Pacific 

North America 

South America 

Europe 



Ref. 
0.62 

Ref. 
-0.03 

Ref. 
1.72 

Ref. 
1.01 
1.10 

Ref. 
-0.35 
0.42 
0.98 



Ref. 
0.43 

Ref. 
1.05 

Ref. 
2.02 

Ref. 
-0.11 
0.84 
1.40 



Ref. 
0.35 

Ref. 
0.35 

Ref. 
0.34 

Ref. 
0.43 
0.53 

Ref. 
0.50 
0.72 
0.45 



Ref. 
0.38 

Ref. 
0.39 

Ref. 
0.39 

Ref. 
0.55 
0.79 
0.54 



Ref. 
0.08 

Ref. 
0.93 

Ref. 

<0.0001 

Ref. 
0.02 
0.04 

Ref. 
0.48 
0.56 
0.03 



Ref. 
0.25 

Ref. 
0.009 

Ref. 

<0.0001 

Ref. 
0.84 
0.29 
0.01 



Ref. 

1.87 (0.92-3.77) 
Ref. 

0.97 (0.48-1.96) 
Ref. 

5.60 (2.87-10.94) 

Ref. 

2.75 (1.16-6.53) 
3.01 (1.05-8.63) 

Ref. 

0.70 (0.26-1.89) 
1.53 (0.36-6.44) 
2.66 (1.16-6.53) 



Ref. 

1.54 (0.73-3.25) 
Ref. 

2.87 (1.31-6.29) 

Ref. 

9.04 (4.12-19.86) 

Ref. 

0.89 (0.30-2.70) 
2.32 (0.48-11.22) 
4.08 (1.39-11.91) 



ASCUS, atypical squamous cell of undetermined significance; LSIL, low-grade squamous intraepithelial lesion; HSIL, high-grade squamous intraepi- 
thelial lesion; dOR, diagnostic odds ratio. 
Bold values are statistically significant. 



ours expanded to other study designs (i.e., cohort and 
randomized controlled trial). As Sherman et al. [58] rec- 
ommended, a longitudinal study design helps to detect 
missed lesions by repeated cytology before invasive cancer 
occurs. The article by Schiffman et al. [42] (the ASCUS- 
LSIL Triage Study- ALTS), included in our meta-analysis, 
reported that during 2 years of follow-up during which 
study participants were asked to visit at 6-month intervals, 
HCII showed higher sensitivity than and comparable spec- 
ificity with PCR. Finally, while Arbyn et al. [9] restricted 
their analysis to ASCUS, we expanded ours to more 
important clinical outcomes (i.e., LSIL and HSIL). With 
the availability of the test accuracy for these two clinical 
outcomes, misdiagnosis or overtreatment due to screening 
test results can be avoided. 



Type of clinical outcome was a source of interstudy 
heterogeneity in our meta-analysis. Accordingly, both 
tests appeared to have higher accuracy to detect HSIL 
than to detect ASCUS/LSIL. This is expected because of 
the cytological and histological differences between these 
lesions, which are driven by HPV. The other source of 
heterogeneity within HCII studies was the choice of the 
gold standard (i.e., cytology vs. histology). For example, 
an article by Stevens et al. [26] reported that when cyto- 
logy is used as the gold standard, the sensitivity and spec- 
ificity of HCII was 0.87 (95% CI: 0.86-0.89) and 0.47 
(95% CI: 0.44-0.49), respectively. On the other hand, test 
accuracy is different when histology was used as the gold 
standard (sensitivity: 0.79 [95% CI: 0.76-0.82] and speci- 
ficity: 0.56 [95% CI: 0.53-0.59]). Our results showing a 
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higher HCII test accuracy if histology is used as the gold 
standard supports the findings of Sherman et al. [58] that 
a lead time bias occurs if repeat cytology is performed, 
particularly among women with ASCUS or LSIL. Conse- 
quently, one might only detect a smaller proportion of 
CIN3 lesions that do not have sufficient features associ- 
ated with invasive cancer and miss a larger proportion of 
lesions usually associated with invasive cancer [58]. 

Study location is another important source of heteroge- 
neity in our findings. We found higher accuracy of both 
PCR and HCII tests between European and Asia-Pacific 
Region studies and between European and North Ameri- 
can studies. We thought this might be related to the HPV 
type distribution in the different study locations. This 
interpretation is supported by the meta-analysis by Smith 
and associates [59] showing that although 16 and 18 pres- 
ence in all regions, there is difference in HPV specific 
types in different regions, from 16, 31, 33, and 18 in Eur- 
ope; 16, 58, 18, and 51 in Asia-Pacific Region; 16, 18, 31, 
and 35 in North America. 

Results from several large randomized controlled trials 
in Europe [60-62], North America [61, 63, 64], and Asia- 
Pacific [65] supported the use of HPV DNA testing over 
cytology for cervical cancer screening. For example, the 
5-year Population-Based Screening Study Amsterdam 
[62], which included approximately 45,000 women aged 
26-45, reported that HPV cotesting is more sensitive than 
cytology alone to detect baseline CIN2 and 3 and to 
detect cervical cancer at the end of the trial. Another ran- 
domized trial [65] of approximately 132,000 women, aged 
30-59, conducted over 7 years in rural India also reported 
substantially higher sensitivity of HCII over cytology. Our 
analysis showed that while PCR was more sensitive but 
less specific than HCII in detecting ASCUS/LSIL, HCII 
was more sensitive and more specific than PCR in detect- 
ing HSIL. Our findings, therefore, support the use of 
HCII because of its clinical relevance. The 2006 Con- 
sensus Guidelines for the Management of Women with 
CIN or Adenocarcinoma in situ [66] recommended that 
patients with CIN1 preceded by ASCUS or LSIL be 
followed-up with either HPV DNA testing every 
12 months or repeated cytology every 6-12 months. 
CIN1 is heterogeneous in that it may be ASCUS; how- 
ever, it may also include LSIL, ASC-H, or even HSIL 
[67]. Both high-risk and low-risk HPV types may be pres- 
ent in CIN1 lesions [68, 69]. Additionally, several studies 
show that in the absence of treatment there is a high rate 
of spontaneous regression of low-grade cervical lesions 
[70-72] and that CIN1 unusually progresses to CIN2 or 
CIN3 [62, 73]. For example, a study by Moscicki et al. 
[71] reported that in more than 91% of adolescents and 
young women with LSIL, lesions cleared spontaneously 
within 36 months. These findings, together with ours 



support the use of HCII in a cervical screening program. 
This has clinical importance, as a recent report from the 
US Preventive Services Task Force (USPSTF) [74] con- 
cluded that there was insufficient evidence to recommend 
HPV testing for cervical cancer screening. We noticed 
that the estimated accuracy of HCII (HSIL: sensitiv- 
ity = 0.82, specificity = 0.78; LSIL: sensitivity = 0.66, 
specificity = 0.91) from the USPSTF report came from 
the 1999 study by Cuzick et al. [75] of older women. As 
Castle [76] pointed out, the conclusion from the USPSTF 
was reached without the results of randomized, controlled 
trials in Europe and India [60-62, 65], and HPV testing 
in the US was not evaluated. As more evidence accumu- 
lates for HPV testing, the results of our meta-analysis 
could be used as an additional tool for public health 
professionals as they decide the best test for their specific 
cervical screening programs. 

The major strengths of our meta-analysis are the use of 
both the STARD [22] and MOOSE [23] reporting guide- 
lines for study selection, data analysis, and comparison of 
test accuracy, which led to the important condition that 
both tests must be present in the same article. This eligi- 
bility criterion enabled us to minimize a substantial 
source of interstudy heterogeneity. The other strength is 
that our search allowed us to capture articles and studies 
from 1999 to the present, which is the time that HCII has 
been most widely used. The other strength is our inclu- 
sion of other important clinical outcomes that allows 
more conservative application of colposcopy. 

The main limitation in our meta-analysis is that we did 
not include the technique/device for sample preparation 
(i.e., collection of cervical cells and DNA extraction meth- 
ods) and age of the study participants, which could be 
two potential sources of interstudy heterogeneity. The 
large variety of sample collection and preparation meth- 
ods (Table 1) prevented us from establishing meaningful 
groups for a categorical analysis. We also could not 
include the age variable in our meta-analysis because only 
two studies [31] (Luu et al., submitted) provided the rele- 
vant information. 

In summary, we found that while PCR is more sensitive 
but less specific than HCII in detecting ASCUS/LSIL, HCII 
has higher sensitivity and specificity than PCR in detecting 
HSIL, in both screening and diagnostic settings. Given the 
clinical relevance and importance of cervical cancer world- 
wide, our results support the use of HCII in cervical 
screening programs. Also the role of HPV type distribution 
should be explored to determine the worldwide compara- 
bility of HPV test accuracy. While cost of the test has a 
consideration for any screening program, it appears that 
the cost of both HCII and PCR has reduced overtime. Fur- 
ther studies on the cost-effectiveness of HCII over PCR in 
a cervical screening program are, therefore, warranted. 
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